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Abstract —The inability of automated edge detection methods 
inspired from primal sketch models to accurately calculate object 
edges under the influence of pixel noise is an open problem. 
Extending the principles of image perception i.e. Weber-Fechner 
law, and Sheperd similarity law, we propose a new edge detection 
method and formulation that use perceived brightness and neigh¬ 
bourhood similarity calculations in the determination of robust 
object edges. The robustness of the detected edges is benchmark 
against Sobel, SIS, Kirsch, and Prewitt edge detection methods 
in an example face recognition problem showing statistically 
significant improvement in recognition accuracy and pixel noise 
tolerance. 

Index Terms —Edge detection, Primal sketch model, Perceived 
brightness, Local stimuli. 

I. Introduction 

HE concept of spatial change detection is interrelated to 
the primal sketch theory CO, 13 , 0 where the change in 
the intensities over a visual field is taken as feature than pixel 
intensities themselves. The primal sketch relates the geometry 
of the image intensity changes in the images to the early 
psychophysical mechanism of human eye. These edge models 
reflect to the general idea of spatial change detection (4) from 
the neighbourhood of the pixel under consideration that find 
application as a feature preprocessing stage in general image 
bases pattern recognition problems 151 , || 51 . 

This paper presents a noise robust edge detection method 
using the local stimuli computed from the amount perceived 
brightness at each image location for enhancing the quality of 
image inherent structural and intensity edge information. The 
proposed method computes a relative numerical quantification 
of the physical stimuli contributed to the image formation. 
Such computed local stimuli helps to extract and preserve 
underlying structural edge details. The quality of the detected 
edges are demonstrated through examples of simulations with 
hypothetical edges and through standard face recognition 
problem. 

II. Background 

In digital images, object details are analyzed with the help of 
responses computed for intensity changes at its boundaries that 
are referred to as the edges 0. This primary idea is inspired 
by the biological vision processing of human eye and in its 
early form is formulated by primal sketch model 0 describes 
an image using the image inherent structures based on the 
response computed with respect to edge formations using 
optimal smoothing filters and detection of intensity changes. 
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The primary criteria of any edge detection technique include 
its ability to provide best response to edges, good localization, 
continuity of edges and tolerance to image noise cum natural 
variance within the same regions. A comprehensive description 
of an image inevitably requires a robust retrieval of edge 
information by satisfying the criteria for edge detection. The 
quality of image description based on primal sketch model 
can be mathematically modeled with help of sketchable and 
unsketchable edge responses m. The term sketchability is 
meant for useable edges and unsketchable edge responses 
are referred to the false edges formed due the intra-region 
variability together with the image noise. Primal structures 
retrieved comprising of all sketchable edges can be described 
as the ideal edge detection technique. In real world images, it 
is not possible to perform an ideal detection, but to optimize 
for minimal trade-off between sketchable and unsketchable 
edges the best efforts are made by applying smoothing as 
the first stage of edge detection. Unfortunately, this will 
contribute to reduction of many of the fine details due to 
the averaging operations used to implement the smoothing. 
This effect can be controlled up to certain degrees, by using 
specialized structure preserved denoising techniques such as 
non-local means filter m, noise suppression filters h, m, 
and anisotropic diffusion filtering [11]). On the other hand, 
hypothetically, an edge response computation with inherent 
intra-region variability suppression can lead to more robust 
edge detection approaches. In this paper, the focus is on the 
suppression of intensity variability other than edges in order 
to minimizes the unsketchable primitives. 

III. Proposed Method 

We propose to make use of Weber-Fechner law Ml to 
mathematically compute the amount of perceived brightness 
and the corresponding local stimuli. The aim is to enhance 
the inherent image features by weighting the original image 
using the magnitude of local stimuli resulting in the image 
formation. This magnitude of local stimuli is computed by 
finding the local variation in perceived brightness at the 
respective locations. Fechner law states that the amount of 
perceived brightness/loudness is equal to the logarithm the 
measured intensity of the physical stimulus. Thus, the per¬ 
ceived brightness, B p for a given image /, is expressed as 

B p = k log 10 (J) (1) 

where k is a constant. 

The gradients of the perceived brightness would represent 
the edge information in the image. Since the gradient itself 
is a linear operator the noisy pixel will not be suppressed by 
this operation. In addition, the gradient is a form of distance 
measure in context of measuring the localised inter-pixel 
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variability. While there exists several of distance measures 
that are suitable to be used for detecting the spatial change 
a, we propose to use Shepherd’s similarity measure UM 
that would give a measure of perceived similarity on the 
measured distance. In addition to the psychological dimension 
of the Shepherd’s similarity, this measure has the ability to 
suppress the outliers in the distance calculations due to the 
exponential nature of the transform. Applying this theory to 
practise, localized spatial stimuli magnitude is numerically 
quantified as the net variation of perceived brightness along x 
and y directions. The variation of B along x (V x ) and y (V y ) 
directions is computed as a function of respective gradients 
(g x and g y ) and is expressed as: 

V x = g x .exp(-\g x \) ■ V y = g y .exp(-\g y \) (2) 

The magnitude of local stimuli (I/) can be computed using 
Eq. 3. 

v = ^ + V* (3) 

The inherent image features by pixel wise weighting of 
original image (Fig. la) is computed using Eq. 3 to obtain 
the local stimuli magnitude mapj ( V ) (Fig. lb). 


(b) 

Fig. 1. (a) Original image I, (b) Local stimuli magnitude map V 


Algorithm 1 Local Stimuli Template 

l: procedure LST (I) 

2: Read image, I. 

3: B <— k log(/) // Perceived brightnes^l 

4: [9x,9y] gradient(B). // Gradient operation, using 

central differencing scheme. 

5: V x g x exp(— \g x \) II Local change in brightness 

(stimuli) along x axis. 

6: V y <— g y exp(— \g y \) //Local change in brightness 

(stimuli) al ong y axis . 

7: V <— \Jv% + Vy //Local stimuli map. 

8 : end procedure 


Algorithm 1 provides the formal summary of the proposed 
feature enhancement process based on local stimuli weighting. 
Figure 2 shows a step wise graphical illustration of algorithm 
1 and indicates the proposed methods ability to detect and 
amplify edge signals in step 3-4. The proposed method retrieve 
edges by computing the local stimuli experienced at each pixel 
location with the help of the amount of brightness perceived 

'it may be noted that the alternate form V x = \g x \(l — e~\ 9x \) and 
V y = |pj/|(l — will be also able to amplify edges and suppress the 

noise, however, will have difficulty identifying slow varying and low contrast 
edges. 

2 The scaling factor k is set as 1 in our experiments. The sample implemen¬ 
tation of the code in MATLAB can be obtained by contacting the authors. 


to form the image. The local stimuli is computed as the two 
dimensional variations in perceived brightness. The natural 
variability in computed brightness is observed to be less; 
at the same time edge information is sufficient to compute 
the responses. Also exponential weights are introduced to the 
filtering scheme, which further helps in the suppression of 
intra-region intensity variability. 



Fig. 2. A graphical illustration on the working of the proposed edge detection 
method. 

Figure 3 illustrates the comparison of the proposed method 
for the amount of underlying structural information extracted 
with that of the existing methods such as Sobel, SIS, Kirsch, 
and Prewitt m for a sample face image. The images visually 
illustrate that the proposed method produces meaningful edge 
features compared to other methods, where the binary edges 
are obtained using the adaptive Ostu’s threshold technique fl5l . 
Proposed method shows least spurious structures with much 
better distinction from corresponding background. Edge re¬ 
sponses computed for a set of images with diverse imaging 
conditions from Berkeley Segmentation Dataset is displayed 
in Fig. 4. 



(a) Sobel (b) SIS (c) Kirsch (d) Prewitt (e) Proposed 



(f) Sobel (g) SIS (h) Kirsch (i) Prewitt (j) Proposed 


Fig. 3. Detected edges and binary maps computed for visualizing edge 
response (a-e) and the underlying structural details (f-j) of original image 
extracted using various methods namely Sobel, SIS, Kirsch and Prewitt, in 
comparison to the proposed method 

IV. Experimental Results 

For better understanding, the responses to edge computed 
for classical primal sketch modefl and proposed scheme for 
various shapes are shown in Table IIIIIIIII1 Table U illustrates 
the edges computed for varying gaussian noise (cr^) in the 

3 http://www.eecs.berkeley.edu/Research/Projects/CS/vision/bsds/ 

4 Primal Sketch model results shown are based on [2], theory of edge 
detection by Marr, where an edge is defined as the zero crossing along region 
transitions. And to display the fundamental model of edge detection, we use 
second order operation represented by Laplacian operator. 



















IEEE SIGNAL PROCESSING LETTERS, 2015 


3 



A 



(a) 

(b) (c) (d) 

(e) 

j $£ 


pi 

■ 1 



(f) (g) (h) (i) Ci) 

Fig. 4. Edge responses computed for natural scenes shared by Berkeley 
Segmentation Dataset, with original images (a-e) and corresponding edge 
responses (f-j). 


order of 0%, 5%, 10%, and 20% of the maximum of image 
intensities for step-edge. From results, primal sketch model 
based scheme fails to detect edges beyond 5% increase in 
variability, whereas the proposed method is proves to be 
capable of detecting edges even at 20% intensity variability. 
This indicates the proposed method’s capability in extracting 
maximum number of sketchable edges with minimal unsketch- 
able details. 

Table HU illustrates the response on Gaussian shaped in¬ 
tensity changes with increasing intensity variability. Results 
depicts that the primal sketch model without smoothing fails 
even with slight increase in variability. Whereas, the proposed 
method shows a good tolerance against increasing intensity 
variability with a comparable edge localization. Table HXU 
depicts spatial edge response for a ramp shape, where it can 
be observed that variability tolerance for method schemes are 
very poor, whereas the proposed method is shown to be more 
robust to pixel noise. 

TABLE I 

Comparison of Edge Responses computed using a step shape for 

VARYING PERCENTILE STANDARD DEVIATIONS IN REFERENCE TO THE 
MAXIMUM INTENSITY VALUE 
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Face images are used to study the quality of the edges de¬ 
tected by our proposed method in a realistic imaging condition. 
Performance evaluation of the proposed method is quantified 
using percentage correctness, with the test data sets se- 


5 The objective of the experiments are intended to objectively look at the 
quality of the edge features rather than with studying or comparing with 
existing face recognition methods. 


TABLE II 

Comparison of Edge Responses computed using a Gaussian 

SHAPE FOR VARYING PERCENTILE STANDARD DEVIATIONS IN REFERENCE 
TO THE MAXIMUM INTENSITY VALUE 
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TABLE III 

Comparison of Edge Responses computed using a ramp shape 

FOR VARYING PERCENTILE STANDARD DEVIATIONS IN REFERENCE TO THE 
MAXIMUM INTENSITY VALUE 
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lected from face databases namely, ORlfi AF0, GeorgiaTech 
(GTechfl and Japanese Female Facial Expression (JAFFEjE 
These databases cover a wide range of imaging conditions. 
Unless otherwise specified, the proposed method is evaluated 
against other commonly used edge detection methods such 
as Prewitt, Kirsch, SIS and Sobel using a minimum distance 
nearest neighbour classifier. 

The recognition accuracy and area under receiver operator 
characteristics curve, for varying size of the training set is 
illustrated with the help of Fig. fusing average performances 
from databases ORL, AR, GTech, and JAFFE. In a general, 
the proposed method shows a consistent improvement in 
robustness over the other methods. 

The comparisons of average recognition accuracy on differ¬ 
ent classifiers are tabulated for proposed method benchmark 
against Kirsch, Prewitt, SIS, and Sobel edge detection methods 
in Tables IV using databases ORL, AR, GTech, and JAFFE. 

6 ORL face images are acquired at different times with varying the lighting, 
facial expressions (open / closed eyes, smiling / not smiling) and facial 
details (glasses / no glasses). The background used is a dark homogeneous 
background with the subjects in an upright, frontal position (with tolerance 
for some side movement). 

7 AR Face Database represents different image conditions such as facial 
expressions, illumination conditions, and occlusions (sun glasses and scarf) 

8 Georgia Tech (GTech) face images show frontal and/or tilted faces with 
different facial expressions, lighting conditions and scale. 

9 JAFFE database contains 213 images of 7 facial expressions (6 basic 
facial expressions + 1 neutral) posed by 10 Japanese female models. Each 
image has been rated on 6 emotion adjectives by 60 Japanese subjects. 
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Fig. 5. Average variation in accuracy (%) (a), and area under ROC (b) for 
varying sizes of training set (%) computed for ORL, AR, Georgia Tech, and 
JAFFE face databases calculated using a lazy classifier. 


The classification experiments are conducted by dividing the 
database into equal sized sets of training and test images. The 
classifiers used in the recognition experiments are IB-k ltT6l . 
NNgelT7l. random forest [IH , and SMO fl9l . The results indi¬ 
cate statistically significant improvement in robustness of the 
proposed edge features over the existing edge models. 

TABLE IV 

Average recognition accuracy (%) for different classifiers 
USING FACE DATABASES ORL, AR, GEORGIA TECH, AND JAFFE 


V. Conclusion 

In this paper, we presented the idea of spatial stimuli 
gradient sketch model inspired from the Fechner and Weber 
law, and Sheperd similarity measure that enables relationships 
between the image intensity and psychological measurement 
space. The use of these concepts results in edges that are 
robust to noise in pixels intensity along the edges of different 
nature and overcome the limitations of edges based on primal 
sketch models. The robustness of the method is illustrated 
with the help of face recognition experiments on some of 
the standard benchmark databases that cover the wide range 
of possible edges in the digital images. The use of proposed 
method showed a statistically significant improvement in the 
recognition accuracies over the benchmark edge detection 
methods. The comparison shown in Fig 6 indicate that while 
almost every benchmark edge detection methods fail to work 
as useful features in a face recognition problem, the proposed 
edges show a higher level of tolerance to pixel noise levels. In 
future, the proposed spatial stimuli gradient sketch model can 
be used to extract more useful features for a wide range of 
problems such as image registration, image recognition, and 
image segmentation, where the use of edges is proven to be 
useful and higher levels of abstraction shown to improve the 
performances. 


Method 

Database 

IBk 

NNge 

RandomForest 

SMO 

Proposed 

76.1±6.4 

67.0±9.2 

41.4±9.3 

80.9±6.1 

Kirsch 

46.8±8.5 

41.6±9.7 

32.2±8.4 

62.02±8.9 

Prewitt 

54.3±8.2 

44.9±7.8 

34.4±7.2 

61.6±8.1 

SIS 

31.2±8.2 

25.6±7.2 

20.1±8.9 

39.5±8.8 

Sobel 

63.6±7.5 

50.1±11.1 

40.7±10.5 

77.1±8.5 


Proposed 



Intensity Variance (%) 

Fig. 6. Average accuracy computed from example AR face database illus¬ 
trating the impact of percentage intensity variability in pixels. Here, x axis 
represents the maximum additive noise variance per pixel. 

The proposed edge model contribute to reducing intra- 
regional intensity variance that in turn reflects improvements 
in the recognition accuracy. For illustrating the effect, face 
databases are added with additive noise variance in increasing 
order. Figure 6 show significantly better tolerance towards in¬ 
tensity variance in comparison to other methods. The proposed 
theory preserves the edge information using fundamental prin¬ 
ciples from the psychology of vision processing. While the 
noise itself is not substantially removed as seen from Table 
maninn and Fig 5, 6, while it does indicate that edges are 
preserved and that it can be processed further using filtering 
schemes to retrieve edges close to its non-noisy state. 
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